Poincaré-Map-Based Reinforcement Learning For Biped Walking
نویسندگان
چکیده
We propose a model-based reinforcement learning algorithm for biped walking in which the robot learns to appropriately modulate an observed walking pattern. Viapoints are detected from the observed walking trajectories using the minimum jerk criterion. The learning algorithm modulates the via-points as control actions to improve walking trajectories. This decision is based on a learned model of the Poincaré map of the periodic walking pattern. The model maps from a state in the single support phase and the control actions to a state in the next single support phase. We applied this approach to both a simulated robot model and an actual biped robot. We show that successful walking policies are acquired.
منابع مشابه
Nonparametric representation of an approximated Poincaré map for learning biped locomotion
We propose approximating a Poincaré map of biped walking dynamics using Gaussian processes. We locally optimize parameters of a given biped walking controller based on the approximated Poincaré map. By using Gaussian processes, we can estimate a probability distribution of a target nonlinear function with a given covariance. Thus, an optimization method can take the uncertainty of approximated ...
متن کاملAcquisition of a Biped Walking Policy Using an Approximated Poincaré Map
We propose a model-based reinforcement learning algorithm for biped walking in which the robot learns to appropriately place the swing leg. This decision is based on a learned model of the Poincaré map of the periodic walking pattern. The model maps from a state at a single support phase and foot placement to a state at the next single support phase. We applied this approach to both a simulated...
متن کاملAnalysis of 3D Passive Walking Including Turning Motions for the Finite-width Rimless Wheel
The focus of studies in the field of passive walking has often been on straight walking, while less attention has been paid to the field of turning motions. In this paper, the passive motions of a finite width rimless wheel as the simplest 3D model of passive biped walkers was investigated with a focus on turning motions. For this purpose, the hybrid model of the system consisting of continuous...
متن کاملA Simple Reinforcement Learning Algorithm For Biped Locomotion
We propose a model-based reinforcement learning algorithm for biped walking in which the robot learns to appropriately place the swing leg. This decision is based on a learned model of the Poincare map of the periodic walking pattern. The model maps from a state at the middle of a step and foot placement to a state at next middle of a step. We also modify the desired walking cycle frequency bas...
متن کاملStable Gait Planning and Robustness Analysis of a Biped Robot with One Degree of Underactuation
In this paper, stability analysis of walking gaits and robustness analysis are developed for a five-link and four-actuator biped robot. Stability conditions are derived by studying unactuated dynamics and using the Poincaré map associated with periodic walking gaits. A stable gait is designed by an optimization process satisfying physical constraints and stability conditions. Also, considering...
متن کامل